BIOTECHNOLOGY ez*4^ ,, : >_ , 

RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 0 



Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212, 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 





Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

1 Wrapped Nucleics 

Wrapped Aminos 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



_lnvalid Line Length The rules require that a line not exceed 72 characters in length. *fhis includes white spaces. 



3 ■ Misaligned Amino 

Numbering 

4 Non-ASCII 



II 



12 



Variable Length - 



_PalentIn 2.0 
"bug" 



(NEW RULES) 



_Use of n*s or Xaa's 
(NEW RULES) 



lOjU Invalid <213> 

Response 



Useof<220> 



_PatentIn 2.0 
"bug" 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved in ASCII text. 

Sequencers) / ^ contain n's or Xaa's representing more than, one residue. Per Sequence Rules, 
each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused Afe <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would' automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



_Skipped Sequences Sequences) 
(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X** is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION :SEQ ID NO:X: (insert SEQ*ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



8 Skipped Sequences' Sequences) _ 



missing. If Intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per L823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
sptentificname (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequences) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown-" Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register, "06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 . Misuse of n *" h can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 
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OIPE 



RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/09/911,261 TIME: 

Input Set : A:\109845-130.txt 

Output Set: N:\CEF3\08012001\I911261.raw 



08/01/2001 

15 :07 £^es Not Comply 
Corrected Diskette Needed 



P 1 



2 <110> APPLICANT: Sera, Takashi 

4 <120> TITLE OF INVENTION: Zinc Finger Domain Recognition Code and Uses Thereof 

6 <130> FILE REFERENCE: 109846-130 

C--> 8 <140> CURRENT APPLICATION NUMBER: US/09/911,261 

C--> 8 <141> CURRENT FILING DATE: 2001-07-23 

8 <150> PRIOR APPLICATION NUMBER: US 60/220,060 

10 <151> PRIOR FILING DATE: 2000-07-21 

12 <160> NUMBER OF SEQ ID NOS : 69 

14 <170> SOFTWARE: Patentln version 3.0 



ERRORED SEQUENCES 



ammo 



16 
17 
18 
19 
21 
22 
24 
25 
26 
27 

28 
30 
31 
32 
33 

34 
36 
37 
38 
39 

acids 

40 

E--> 42 
W--> 44 

45 

W--> 47 

48 
51 
52 
53 
54 
56 
57 



CD 



<210> SEQ ID NO: 1 
<211> LENGTH: 28 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Zinc finger domain. 

<220> FEATURE: 

<221> NAME/KEY: VARIANT 

<222> LOCATION: (1)..(28) ^ / C 

<223> OTHER INFORMATION: Amino acids 1-3, 8-19 and 25-28 are Xaa wherein Xaa = any 

acid. 
<220> FEATURE: 
<221> NAME/KEY: VARIANT 
<222> LOCATION: (5).. (6) 

<223> OTHER INFORMATION: Amino acid 5 is Xaa wherein Xaa = any amino acid, amino acids 




and ^Together represent from 2 to 4_amino acids in length 
<220> V >ESTO 

<221> NAME/KEY : VARIANT 

<222> LOCATION: (21).. (23) - - - 

<223> OTHER INFORMATION: Amino acid 21 is Xaa wherein Xaa = any amino acid, amino jU^&U- 



7~s> 0^ * 



2- 3- - to g^fcher represent . from 3 t 
<40£> SEQUENCE ^J^SL^^ P~ V 



Xaa Xaa xaa Cy 

1 / r' 
Xaa Xaa Xaa His 




mino acids in length. 



yd Xaa Xala Xaa Xa4 Xaa Xa£ XaTa Xda Xaa 
^ 10 / 15 

Xaa) His Xaa Xaa" Xaa Xaa 

25 



j 

JLAai\J 



<210> SEQ ID NO: 2 
<211> LENGTH: 28 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Zinc finger domain. 



(fy\ iWi/ 



file://C:\Crf3\Outholcf\VsrI911261.htm 



8/1/01 



59 <220> FEATURE : 

60 <221> NAME/KEY: VARIANT 



file://C:\Crf3\Outhold\VsrI91 1261.htm 
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RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/911 , 261 TIME: 15:07:46 

Input Set : A:\109845-130.txt 

Output Set: N:\CRF3\08012001\I911261.raw 

61 <222> LOCATION: <1)..(28) ^ ^~ T / ^ 

63 <223> OTHER INFORMATION: Amino acids 1-3, 8-12, 14, 17-18 and 25-28 are Xaa wherein 

Xaa = 

64 any amino acid. 
6 6 <220> FEATURE: 

67 <221> NAME/KEY: VARIANT 

68 <222> LOCATION: (5).. (6) 

£ q ot hrp INFORMATION: Amino a,cid 5 is Xaa wherein Xaa = any A/iinp acid, amino acids 




HER INFORMATION: Amino acid 21 is Xaa wherein Xaa - any amino acid, amino 

acids f) 

21-23 together represent from 3 tof 5 Amino acids in length. 

FEATURE : ^/jL*^ f 



Lys, 



Arg, 



Lys, 



70 


72 


<220> 


73 


<221> 


74 


<222> 


75 


<223> 


76 




78 


<220> 


79 


<221> 


80 


<222> 


81 


<223> 


82 




84 


<220> 


85 


<221> 


86 


<222> 


87 


<223> 


88 




90 


<220> 


91 


<221> 


92 


<222> 


93 


<223> 


94 




96 


<220> 


97 


<221> 


98 


<222> 


99 


<223> 


100 


102 «40CT 




and ^/together represent from 2 



is Xaa wherein Xaa = any amino 
4 Jamino acids in length. 



LOCATION: (13).. (13) ^ 

OTHER INFORMATION: Amino acid 13 is Xaa wherein : 
Gin or Asn, Thr, Met, Leu or lie, or Glu or Asp. 



Asn Gin, Thr, Val or Ala, or Asp or Glu. 
FEATURE : 

NAME/KEY: VARIANT ^ 
LOCATION: (16) . . (16) 

OTHER INFORMATION: Amino acid 16 is Xaa wherein X< 
Asn or Gin, Ser, Ala, or Val, or Asp or Glu. 



i r or &cm r jTh-r . Tyr, Leu, He or Met, or Glu or Asp. 




Lys, 



1051/ f y >-^*zr-r s ■ r x f c "T 15 ' 

W--> 107 Xaa Xaa Xaa His Xaa Xaa Xaa His Xaa Xaa Xaa Xaa 

108 20 25 

111 <210> SEQ ID NO: 3 

112 <211> LENGTH: 196 

113 <212> TYPE: PRT 

114 <213> ORGANISM: Artificial Sequence 

116 <220> FEATURE: 

117 <22> > OTHER -XHEQEMAT ION: Zinc finger protein. 
E--> 119<<400> SEQUENCeTjTV? & 

121 V al Px ' O He Pio ^gty Lys Lys Lys Gin His He Cys His He Gin Gly 



file://C:\Crf3\Outhold\VsrI9 1 1 26 1 .htm 



8/1/01 
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122 15 10 15 

124 Cys Gly Lys Val Tyr Gly Gin Ser Ser Asp Leu Gin Arg His Leu Arg 

125 20 25 30 
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RAW SEQUENCE LISTING DATE: 08/01/2001 

PATENT APPLICATION: US/09/911 , 261 TIME: 15:07:46 



Input Set : A:\109845-130.txt 

Output Set: N:\CRF3\08012001\I911261.raw 



127 


Trp 


His 


Thr 


Gly 


Glu 


Arg 


Pro 


Phe 


Met 


Cys 


Thr 


Trp 


Ser 


Tyr 


Cys 


Gly 


128 




35 










40 










45 








130 


Lys 


Arg 


Phe 


Thr 


Arg 


Ser 


Ser 


Asn 


Leu 


Gin 


Arg 


His 


Lys 


Arg 


Thr 


His 


131 




50 










55 










60 










133 


Thr 


Gly 


Glu 


Lys 


Lys 


Phe 


Ala 


Cys 


Pro 


Glu 


Cys 


Pro 


Lys 


Arg 


Phe 


Met 


134 


65 










70 










75 










80 


136 


Arg 


Ser 


Asp 


Glu 


Leu 


Ser 


Arg 


His 


He 


Lys 


Thr 


His 


Gin 


Asn 


Lys 


Lys 


137 








85 










90 










95 




139 


Asp 


Gly 


Gly 


Gly 


Ser 


Gly 


Lys 


Lys 


Lys 


Gin 


His 


He 


Cys 


His 


He 


Gin 


140 








100 










105 










110 






142 


Gly Cys 


Gly 


Lys 


Val 


Tyr 


Gly 


Thr 


Thr 


Ser 


Asn 


Leu 


Arg 


Arg 


His 


Leu 


143 






115 










120 










125 








145 


Arg 


Trp 


His 


Thr 


Gly 


Glu 


Arg 


Pro 


Phe 


Met 


Cys 


Thr 


Trp 


Ser 


Tyr 


Cys 


146 




130 










135 










140 










148 


Gly 


Lys 


Arg 


Phe 


Thr 


Arg 


Ser 


Ser 


Asn 


Leu 


Gin 


Arg 


His 


Lys 


Arg 


Thr 


149 


145 










150 










155 










160 


151 


His 


Thr 


Gly Glu 


Lys 


Lys 


Phe 


Ala 


Cys 


Pro 


Glu 


Cys 


Pro 


Lys 


Arg 


Phe 


152 










165 










170 










175 




154 


Met 


Arg 


Ser 


Asp 


His 


Leu 


Ser 


Arg 


His 


He 


Lys 


Thr 


His 


Gin 


Asn 


Lys 


155 








180 










185 










190 






157 


Lys 


Gly 


Gly 


Ser 


























158 






195 





























file://C:\Crf3\Outhold\VsrI9 1 1 26 1 .htm 



<4oo£) i ^^^^^yj^_^ ( f n> 

Xaa Xaa Xaa Cys Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 




Use of n and/or Xaa has been detected in the Sequence Listing. 
Review the Sequence Listing to insure a corresponding 
explanation is presented in the <220> to <223> fields of 
each sequence using n or Xaa. 



<211> 45 
<212> DNA 



<|13> Artificial Sequence (^CR Primer) fl^ ^ Z2Z3?X^c) hoT^Z/Z? 



ttcagggcgg tctctcggct tctcgccagt gtgagtacgc tgatg / 

JU) 



• «?/?//, 2W 



<400> 69 



Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Xaa Ser Xaa Xaa 
15 10 15 

Leu Ser Xaa His Gin Arg Thr His Thr Gly Glu Lys 

20 25 

/l09845-130/s?2^\^ 
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L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 



VERIFICATION SUMMARY DATE: 08/01/2001 

PATENT APPLICATION: US/09/911, 261 TIME: 15:07:47 

Input Set : A:\109845-130.txt 

Output Set: Ni \CRF3\08012001\I911261 . raw 

8 M:270 C: Current Application Number differs, Replaced Current Application No 
8 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

42 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS : 1 differs :0 
(46) "n" or "Xaa" used, for SEQ ID# : 0 
(46) "n" or "Xaa" used, for SEQ ID# : 0 

34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS : 2 differs :0 
"n" or "Xaa" used, for SEQ ID# : 0 
"n" or "Xaa" used, for SEQ ID# : 0 

Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS : 3 differs :0 



44 M:341 W 
47 M:341 W 
102 M:212 E 
104 M:341 W 
107 M:341 W 
119 M:212 E 
559 M:341 W 
562 M:341 W 
801 M:341 W 
835 M:341 W 
852 M:341 W 
870 M:341 W 
887 M:341 W 
904 M:341 W 
922 M:341 W 
939 M:341 W 
956 M:341 W 
973 M:341 W 
991 M:341 W 
1009 M:341 W 



46 
46 
34 
46 
46 
46 
46 
46 
46 
46 
46 
46 
46 
46 
46 



"n" 

»n" 

"n" 
,i n ii 

"n" 
„ n t, 

"n" 
"n" 
"n f 
"n r 

"n' 
"n' 



or "Xaa" 
or "Xaa" 
or "Xaa" 
or "Xaa" 
or "Xaa" used, 
or "Xaa" used, 
or "Xaa" used, 



used, 
used, 
used, 
used, 



for SEQ ID# : 13 
for SEQ ID# : 13 
for SEQ ID#:30 
for SEQ ID# : 31 
for SEQ ID#:32 
for SEQ ID#:33 
for SEQ ID# : 34 



or "Xaa" used, for SEQ ID#:35 
or "Xaa" used, for SEQ ID#:36 



or "Xaa" used, 

or "Xaa" used, 

or "Xaa" used, 

or "Xaa" used, 



for SEQ ID#:37 
for SEQ ID#:38 
for SEQ ID#:39 
for SEQ ID#:40 



46] 

„ ... (46) "n" or "Xaa" used, for SEQ ID#:41 

1151 M:258 W: Mandatory Feature missing, <220> FEATURE: 

1151 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 

1348 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:68 

1351 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:68 

1389 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:69 

1392 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:69 
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